13 research outputs found

    The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework

    Full text link
    Computers continue to diversify with respect to system designs, emerging memory technologies, and application memory demands. Unfortunately, continually adapting the conventional virtual memory framework to each possible system configuration is challenging, and often results in performance loss or requires non-trivial workarounds. To address these challenges, we propose a new virtual memory framework, the Virtual Block Interface (VBI). We design VBI based on the key idea that delegating memory management duties to hardware can reduce the overheads and software complexity associated with virtual memory. VBI introduces a set of variable-sized virtual blocks (VBs) to applications. Each VB is a contiguous region of the globally-visible VBI address space, and an application can allocate each semantically meaningful unit of information (e.g., a data structure) in a separate VB. VBI decouples access protection from memory allocation and address translation. While the OS controls which programs have access to which VBs, dedicated hardware in the memory controller manages the physical memory allocation and address translation of the VBs. This approach enables several architectural optimizations to (1) efficiently and flexibly cater to different and increasingly diverse system configurations, and (2) eliminate key inefficiencies of conventional virtual memory. We demonstrate the benefits of VBI with two important use cases: (1) reducing the overheads of address translation (for both native execution and virtual machine environments), as VBI reduces the number of translation requests and associated memory accesses; and (2) two heterogeneous main memory architectures, where VBI increases the effectiveness of managing fast memory regions. For both cases, VBI significanttly improves performance over conventional virtual memory

    POLCA: Power Oversubscription in LLM Cloud Providers

    Full text link
    Recent innovation in large language models (LLMs), and their myriad use-cases have rapidly driven up the compute capacity demand for datacenter GPUs. Several cloud providers and other enterprises have made substantial plans of growth in their datacenters to support these new workloads. One of the key bottleneck resources in datacenters is power, and given the increasing model sizes of LLMs, they are becoming increasingly power intensive. In this paper, we show that there is a significant opportunity to oversubscribe power in LLM clusters. Power oversubscription improves the power efficiency of these datacenters, allowing more deployable servers per datacenter, and reduces the deployment time, since building new datacenters is slow. We extensively characterize the power consumption patterns of a variety of LLMs and their configurations. We identify the differences between the inference and training power consumption patterns. Based on our analysis of these LLMs, we claim that the average and peak power utilization in LLM clusters for inference should not be very high. Our deductions align with the data from production LLM clusters, revealing that inference workloads offer substantial headroom for power oversubscription. However, the stringent set of telemetry and controls that GPUs offer in a virtualized environment, makes it challenging to have a reliable and robust power oversubscription mechanism. We propose POLCA, our framework for power oversubscription that is robust, reliable, and readily deployable for GPU clusters. Using open-source models to replicate the power patterns observed in production, we simulate POLCA and demonstrate that we can deploy 30% more servers in the same GPU cluster for inference, with minimal performance los

    Hybrid Computing for Interactive Datacenter Applications

    Full text link
    Field-Programmable Gate Arrays (FPGAs) are more energy efficient and cost effective than CPUs for a wide variety of datacenter applications. Yet, for latency-sensitive and bursty workloads, this advantage can be difficult to harness due to high FPGA spin-up costs. We propose that a hybrid FPGA and CPU computing framework can harness the energy efficiency benefits of FPGAs for such workloads at reasonable cost. Our key insight is to use FPGAs for stable-state workload and CPUs for short-term workload bursts. Using this insight, we design Spork, a lightweight hybrid scheduler that can realize these energy efficiency and cost benefits in practice. Depending on the desired objective, Spork can trade off energy efficiency for cost reduction and vice versa. It is parameterized with key differences between FPGAs and CPUs in terms of power draw, performance, cost, and spin-up latency. We vary this parameter space and analyze various application and worker configurations on production and synthetic traces. Our evaluation of cloud workloads shows that energy-optimized Spork is not only more energy efficient but it is also cheaper than homogeneous platforms--for short application requests with tight deadlines, it is 1.53x more energy efficient and 2.14x cheaper than using only FPGAs. Relative to an idealized version of an existing cost-optimized hybrid scheduler, energy-optimized Spork provides 1.2-2.4x higher energy efficiency at comparable cost, while cost-optimized Spork provides 1.1-2x higher energy efficiency at 1.06-1.2x lower cost.Comment: 13 page

    A Stress Induced Source of Phonon Bursts and Quasiparticle Poisoning

    Full text link
    The performance of superconducting qubits is degraded by a poorly characterized set of energy sources breaking the Cooper pairs responsible for superconductivity, creating a condition often called "quasiparticle poisoning." Recently, a superconductor with one of the lowest average quasiparticle densities ever measured exhibited quasiparticles primarily produced in bursts which decreased in rate with time after cooldown. Similarly, several cryogenic calorimeters used to search for dark matter have also observed an unknown source of low-energy phonon bursts that decrease in rate with time after cooldown. Here, we show that a silicon crystal glued to its holder exhibits a rate of low-energy phonon events that is more than two orders of magnitude larger than in a functionally identical crystal suspended from its holder in a low-stress state. The excess phonon event rate in the glued crystal decreases with time since cooldown, consistent with a source of phonon bursts which contributes to quasiparticle poisoning in quantum circuits and the low-energy events observed in cryogenic calorimeters. We argue that relaxation of thermally induced stress between the glue and crystal is the source of these events, and conclude that stress relaxation contributes to quasiparticle poisoning in superconducting qubits and the athermal phonon background in a broad class of rare-event searches.Comment: 13 pages, 6 figures. W. A. Page and R. K. Romani contributed equally to this work. Correspondence should be addressed to R. K. Roman

    LEARN: A multi-centre, cross-sectional evaluation of Urology teaching in UK medical schools

    Get PDF
    OBJECTIVE: To evaluate the status of UK undergraduate urology teaching against the British Association of Urological Surgeons (BAUS) Undergraduate Syllabus for Urology. Secondary objectives included evaluating the type and quantity of teaching provided, the reported performance rate of General Medical Council (GMC)-mandated urological procedures, and the proportion of undergraduates considering urology as a career. MATERIALS AND METHODS: LEARN was a national multicentre cross-sectional study. Year 2 to Year 5 medical students and FY1 doctors were invited to complete a survey between 3rd October and 20th December 2020, retrospectively assessing the urology teaching received to date. Results are reported according to the Checklist for Reporting Results of Internet E-Surveys (CHERRIES). RESULTS: 7,063/8,346 (84.6%) responses from all 39 UK medical schools were included; 1,127/7,063 (16.0%) were from Foundation Year (FY) 1 doctors, who reported that the most frequently taught topics in undergraduate training were on urinary tract infection (96.5%), acute kidney injury (95.9%) and haematuria (94.4%). The most infrequently taught topics were male urinary incontinence (59.4%), male infertility (52.4%) and erectile dysfunction (43.8%). Male and female catheterisation on patients as undergraduates was performed by 92.1% and 73.0% of FY1 doctors respectively, and 16.9% had considered a career in urology. Theory based teaching was mainly prevalent in the early years of medical school, with clinical skills teaching, and clinical placements in the later years of medical school. 20.1% of FY1 doctors reported no undergraduate clinical attachment in urology. CONCLUSION: LEARN is the largest ever evaluation of undergraduate urology teaching. In the UK, teaching seemed satisfactory as evaluated by the BAUS undergraduate syllabus. However, many students report having no clinical attachments in Urology and some newly qualified doctors report never having inserted a catheter, which is a GMC mandated requirement. We recommend a greater emphasis on undergraduate clinical exposure to urology and stricter adherence to GMC mandated procedures

    Multi objective dynamic optimization study of butylated urea formaldehyde resin process in a batch reactor

    No full text
    Butylated urea formaldehyde (BUF) is a key intermediate for manufacturing paint and coating. The quality of BUF resins can be measured in terms of the concentration of free formaldehyde in the BUF resins and the extent of butylation. We in this work present an optimal control study to obtain minimum free formaldehyde concentration and minimum butanol concentration at the end of the batch operation. Reactor temperature is used as the manipulated variable and optimum temporal reactor temperature profiles are obtained using control vector parameterization approach. The two aforementioned criteria are observed to be mutually conflicting and hence the multi-objective optimal control problem is solved in this work yielding the pareto optimal curve showing the trade-off solutions of the MOO problem. Such pareto optimal curve helps the operator to choose an operating condition for a desired operation.by Garima Patel, Shital Amin, Nitin Padhiyar and Pratyush Daya

    CURRENT ASSESSMENTS REGARDING THE PATHOGENESIS AND TREATMENT STRATEGIES OF ORAL LICHEN PLANUS – A REVIEW

    No full text
    Oral Lichen planus (OLP) is an autoimmune chronic inflammatory disease of mucous membrane. It is mostly CD8+T-cell mediated autoimmune response with unknown etiology and pathogenesis. It generally affects approximately 1% to 2% of the world’s population. OLP affects women more than men at a ratio of approximately 1.4:1. The prevalence of OLP ranges between 0.5% and 3% and in Indian populations it is 2.6%. In recent years, many possible causes regarding the pathogenesis of OLP have been suggested, the exact nature is still unclear. Most data suggests that some specific antigen and nonspecific mechanism are involved. Antigen presentation by basal keratinocytes and antigen-specific keratinocyte killing by CD8+ cytotoxic T-cells are said to be antigen-specific mechanisms. It is still not clear whether antigen is exogenous or endogenous in origin and what specific antigen is responsible for triggering the inflammatory responses. Mast cell degranulation and activation of matrix metalloproteinase (MMP) specific mechanisms. This paper explains how these two mechanism work together and also the current understanding regarding other factors which are responsible for its pathogenesis

    Co-production of syngas and potassium-based fertilizer by solar-driven thermochemical conversion of crop residues

    No full text
    We report on the thermochemical conversion of inedible crop residues using concentrated solar energy as the source of high-temperature process heat. Experiments were performed using a 5 kWth solar packed-bed reactor exposed to radiative fluxes up to 1788 suns. The waste biomass feedstock consisted of unprocessed batches of cotton boll, soybean husk, and black mustard husk and straw, which were pyrolysed and steam-based gasified at nominal temperatures in the range 879–1266 °C, yielding high-quality syngas with molar ratios in the range H2:CO = 1.43–3.25, CO2:CO = 0.28–1.40, and CH4:CO = 0.03–0.28. The solar-to-fuel energy conversion efficiency, defined as the ratio of the heating value of the syngas produced to the solar radiative energy input and the heating value of the feedstock, reached 18%. The heating value of the feedstock was solar-upgraded by 7%, thus outperforming autothermal gasification that typically downgrades by at least 15%. The ash contained 23% potassium. The solar-driven thermochemical process offers a sustainable and efficient path for the conversion of agricultural wastes into valuable fuels and soil fertilizers.ISSN:0378-382
    corecore